Selecting Genes by Test Statistics
نویسندگان
چکیده
Gene selection is an important issue in analyzing multiclass microarray data. Among many proposed selection methods, the traditional ANOVA F test statistic has been employed to identify informative genes for both class prediction (classification) and discovery problems. However, the F test statistic assumes an equal variance. This assumption may not be realistic for gene expression data. This paper explores other alternative test statistics which can handle heterogeneity of the variances. We study five such test statistics, which include Brown-Forsythe test statistic and Welch test statistic. Their performance is evaluated and compared with that of F statistic over different classification methods applied to publicly available microarray datasets.
منابع مشابه
Gene Selection for Multi-Class Prediction of Microarray Data
Gene expression data from microarrays have been successfully applied to class prediction, where the purpose is to classify and predict the diagnostic category of a sample by its gene expression profile. A typical microarray dataset consists of expression levels for a large number of genes on a relatively small number of samples. As a consequence, one basic and important question associated with...
متن کاملHykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data
MOTIVATION Recent studies have shown that microarray gene expression data are useful for phenotype classification of many diseases. A major problem in this classification is that the number of features (genes) greatly exceeds the number of instances (tissue samples). It has been shown that selecting a small set of informative genes can lead to improved classification accuracy. Many approaches h...
متن کاملHykGene: An Hybrid Approach for Selecting Marker Genes for Phenotype Classification using Microarray Gene Expression Data
Motivation: Recent studies have shown that microarray gene expression data is useful for phenotype classification of many diseases. In this classification problem, the number of features (genes) greatly exceeds the number of instances (tissue samples). It has been shown that selecting a small set of informative genes can lead to improved classification accuracy. Many approaches have been propos...
متن کاملFactors Affecting the Needs Assessment in Continuing Medical Education: Presenting a Practical Guideline for Selecting Models and Techniques
Introduction: The dispersal of educational needs assessment resources in one hand and different advantages and disadvantages attributed to each of needs assessment models and techniques on the other hand, make it hard to choose an appropriate, accurate, and reliable plan for needs assessment. This study was performed to develop a practical guideline for selecting a need assessment model and tec...
متن کاملA versatile gene-based test for genome-wide association studies.
We have derived a versatile gene-based test for genome-wide association studies (GWAS). Our approach, called VEGAS (versatile gene-based association study), is applicable to all GWAS designs, including family-based GWAS, meta-analyses of GWAS on the basis of summary data, and DNA-pooling-based GWAS, where existing approaches based on permutation are not possible, as well as singleton data, wher...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of Biomedicine and Biotechnology
دوره 2005 شماره
صفحات -
تاریخ انتشار 2005